Phase modelling of speech excitation for low bit-rate sinusoidal transform coding
نویسندگان
چکیده
Sinusoidal transform coding (STC) techniques model speech as the sum of sine-waves whose frequencies, amplitudes and phases are specified at regular intervals. To achieve a low-bit rate representation, only the spectral envelope is encoded and the phases are regenerated according to a minimum phase assumption. In this paper, the inaccuracy of the minimum phase model is demonstrated. It is shown that the phase spectra of decoded speech segments may be corrected using either the parameters of a Rosenberg pulse model or a second order all-pass filter. Experiments have shown that by applying this correction, the phase accuracy increases and the speech quality improves.
منابع مشابه
Dispersion phase vector quantization for enhancement of waveform interpolative coder
This paper presents an efficient analysis-by-synthesis vector quantizer for the dispersion phase of the excitation signal which was used to enhance a waveforminterpolative coder. The scheme can be used to enhance other harmonic coders, such as the sinusoidal-transform coder and the multiband-excitation coder. The scheme incorporates perceptual weighting, and does not require any phase unwarping...
متن کاملA mixed sinusoidally excited linear prediction coder at 4 kb/s and below
There is currently a great deal of interest in the development of speech coding algorithms capable of delivering toll quality at 4 kb/s and below. For synthesizing high quality speech, accurate representation of the voiced portions of speech is essential. For bit rates of 4 kb/s and below, conventional Code Excited Linear Prediction (CELP) may likely not provide the appropriate degree of period...
متن کاملHigh quality MELP coding at bit-rates around 4 kb/s
Recently, a number of coding techniques have been reported to achieve near toll quality synthesized speech at bit-rates around 4 kb/s. These include variants of Code Excited Linear Prediction (CELP), Sinusoidal Transform Coding (STC) and Multi-Band Excitation (MBE). While CELP has been an effective technique for bit-rates above 6 kb/s, STC, MBE, Waveform Interpolation (WI) and Mixed Excitation ...
متن کاملA Flexible Multirate Speech Coder
This paper describes algorithms which provide the capability of parametrically coding speech using the Sinusoidal Transform Coding (STC) at a variety of bit rates and transforming the coded bit stream to lower rate bit stream to lower rates without interaction with the source, through a set of techniquescalled parameter space transformations. Parameter space transformations are a generalization...
متن کاملVariable bit-rate sinusoidal transform coding using variable order spectral estimation
Sinusoidal transform coding (STC) is known to be capable of producing good communication quality speech coded at bitrates below 4kb/s. Discrete all-pole modelling (DAP) is an alternative spectral estimation method which can be more accurate than the conventional linear prediction (LP) analysis normally used by STC. In the quest to achieve the highest possible speech quality at lower and lower a...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1997